PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG001348t1
Common NameTCM_001348
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family Trihelix
Protein Properties Length: 472aa    MW: 54351.8 Da    PI: 6.0009
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG001348t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix94.69.5e-3040124187
          trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                       rW++qe+laL+++r++m+ ++r++  k+plWeevs+k++e g++rs+k+Ckek+en+ k+++++keg+++r+++  +++++f+qlea
  Thecc1EG001348t1  40 RWPRQETLALLKIRSDMDVAFRDSGVKAPLWEEVSRKLAELGYNRSAKKCKEKFENIYKYHRRTKEGRSGRSNG--KNYRFFEQLEA 124
                       8********************************************************************98554..47*******85 PP

2trihelix104.57.9e-33304389187
          trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                       rW+k+ev aLi++r +++ +++++  k+plWee+s +m++ g+ rs+k+Ckekwen+nk++k++ke++kkr +e+s+tcpyf+ql+a
  Thecc1EG001348t1 304 RWPKDEVEALIRLRANLDLQYQDNGPKGPLWEEISTAMKKLGYDRSAKRCKEKWENMNKYFKRVKESNKKR-PEDSKTCPYFHQLDA 389
                       8*********************************************************************8.99***********85 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS500907.1783397IPR017877Myb-like domain
SMARTSM007170.00193799IPR001005SANT/Myb domain
CDDcd122038.71E-2439104No hitNo description
PfamPF138371.4E-1939125No hitNo description
PROSITE profilePS500907.84297361IPR017877Myb-like domain
Gene3DG3DSA:1.10.10.602.5E-5301361IPR009057Homeodomain-like
SMARTSM007171.6E-4301363IPR001005SANT/Myb domain
SuperFamilySSF466894.02E-5301373IPR009057Homeodomain-like
CDDcd122032.89E-25303368No hitNo description
PfamPF138372.9E-24303390No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 472 aa     Download sequence    Send to blast
MMENSGFPEN NTVADNVSLE NEEEVTVKNE ESERNFPGNR WPRQETLALL KIRSDMDVAF  60
RDSGVKAPLW EEVSRKLAEL GYNRSAKKCK EKFENIYKYH RRTKEGRSGR SNGKNYRFFE  120
QLEALDHHPS LLPPATGHIN TSMQPFSVIR DAIPCSIRNP VLSFNETSAS TTSSSGKESD  180
GMRKKKRKLT EFFGRLMREV MEKQENLQKK FIEAIEKSEQ DRMAREEAWK MQELDRIKRE  240
RELLVQERSI AAAKDAAVLA FLQKFSDQAT SVRLPETPFP VEKVVERQEN SNGSESYMHL  300
SSSRWPKDEV EALIRLRANL DLQYQDNGPK GPLWEEISTA MKKLGYDRSA KRCKEKWENM  360
NKYFKRVKES NKKRPEDSKT CPYFHQLDAL YKEKTKRGDG SVNSGYELKP EELLMHMMSA  420
PDERPHQESV TEDGESENAD QNQEENGNAE EEEGDAYQIV ANDPSPMAII G*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1183188KKKRKL
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007048236.10.0Duplicated homeodomain-like superfamily protein, putative
SwissprotQ391171e-127TGT2_ARATH; Trihelix transcription factor GT-2
TrEMBLA0A061DR080.0A0A061DR08_THECC; Duplicated homeodomain-like superfamily protein, putative
STRINGPOPTR_0001s31660.10.0(Populus trichocarpa)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM59952847
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G76880.11e-126Trihelix family protein
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]